Search CORE

101 research outputs found

On The Hereditary Discrepancy of Homogeneous Arithmetic Progressions

Author: Nikolov Aleksandar
Talwar Kunal
Publication venue
Publication date: 08/04/2015
Field of study

We show that the hereditary discrepancy of homogeneous arithmetic progressions is lower bounded by

n^{1/O(\log \log n)}

. This bound is tight up to the constant in the exponent. Our lower bound goes via proving an exponential lower bound on the discrepancy of set systems of subcubes of the boolean cube

\{0, 1\}^d

.Comment: To appear in the Proceedings of the American Mathematical Societ

arXiv.org e-Print Archive

CiteSeerX

Approximating Hereditary Discrepancy via Small Width Ellipsoids

Author: Nikolov Aleksandar
Talwar Kunal
Publication venue
Publication date: 23/07/2014
Field of study

The Discrepancy of a hypergraph is the minimum attainable value, over two-colorings of its vertices, of the maximum absolute imbalance of any hyperedge. The Hereditary Discrepancy of a hypergraph, defined as the maximum discrepancy of a restriction of the hypergraph to a subset of its vertices, is a measure of its complexity. Lovasz, Spencer and Vesztergombi (1986) related the natural extension of this quantity to matrices to rounding algorithms for linear programs, and gave a determinant based lower bound on the hereditary discrepancy. Matousek (2011) showed that this bound is tight up to a polylogarithmic factor, leaving open the question of actually computing this bound. Recent work by Nikolov, Talwar and Zhang (2013) showed a polynomial time

\tilde{O}(\log^3 n)

-approximation to hereditary discrepancy, as a by-product of their work in differential privacy. In this paper, we give a direct simple

O(\log^{3/2} n)

-approximation algorithm for this problem. We show that up to this approximation factor, the hereditary discrepancy of a matrix

A

is characterized by the optimal value of simple geometric convex program that seeks to minimize the largest

\ell_{\infty}

norm of any point in a ellipsoid containing the columns of

A

. This characterization promises to be a useful tool in discrepancy theory

arXiv.org e-Print Archive

Crossref

Efficient Algorithms for Privately Releasing Marginals via Convex Relaxations

Author: Dwork Cynthia
Nikolov Aleksandar
Talwar Kunal
Publication venue
Publication date: 06/08/2013
Field of study

Consider a database of

n

people, each represented by a bit-string of length

d

corresponding to the setting of

d

binary attributes. A

k

-way marginal query is specified by a subset

S

k

attributes, and a

|S|

-dimensional binary vector

\beta

specifying their values. The result for this query is a count of the number of people in the database whose attribute vector restricted to

S

agrees with

\beta

. Privately releasing approximate answers to a set of

k

-way marginal queries is one of the most important and well-motivated problems in differential privacy. Information theoretically, the error complexity of marginal queries is well-understood: the per-query additive error is known to be at least

\Omega(\min\{\sqrt{n},d^{\frac{k}{2}}\})

and at most

\tilde{O}(\min\{\sqrt{n} d^{1/4},d^{\frac{k}{2}}\})

. However, no polynomial time algorithm with error complexity as low as the information theoretic upper bound is known for small

n

. In this work we present a polynomial time algorithm that, for any distribution on marginal queries, achieves average error at most

\tilde{O}(\sqrt{n} d^{\frac{\lceil k/2 \rceil}{4}})

. This error bound is as good as the best known information theoretic upper bounds for

k=2

. This bound is an improvement over previous work on efficiently releasing marginals when

k

is small and when error

o(n)

is desirable. Using private boosting we are also able to give nearly matching worst-case error bounds. Our algorithms are based on the geometric techniques of Nikolov, Talwar, and Zhang. The main new ingredients are convex relaxations and careful use of the Frank-Wolfe algorithm for constrained convex minimization. To design our relaxations, we rely on the Grothendieck inequality from functional analysis

arXiv.org e-Print Archive

CiteSeerX

The Geometry of Differential Privacy: the Sparse and Approximate Cases

Author: Nikolov Aleksandar
Talwar Kunal
Zhang Li
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 03/12/2012
Field of study

In this work, we study trade-offs between accuracy and privacy in the context of linear queries over histograms. This is a rich class of queries that includes contingency tables and range queries, and has been a focus of a long line of work. For a set of

d

linear queries over a database

x \in \R^N

, we seek to find the differentially private mechanism that has the minimum mean squared error. For pure differential privacy, an

O(\log^2 d)

approximation to the optimal mechanism is known. Our first contribution is to give an

O(\log^2 d)

approximation guarantee for the case of (\eps,\delta)-differential privacy. Our mechanism is simple, efficient and adds correlated Gaussian noise to the answers. We prove its approximation guarantee relative to the hereditary discrepancy lower bound of Muthukrishnan and Nikolov, using tools from convex geometry. We next consider this question in the case when the number of queries exceeds the number of individuals in the database, i.e. when

d > n \triangleq \|x\|_1

. It is known that better mechanisms exist in this setting. Our second main contribution is to give an (\eps,\delta)-differentially private mechanism which is optimal up to a \polylog(d,N) factor for any given query set

A

and any given upper bound

n

\|x\|_1

. This approximation is achieved by coupling the Gaussian noise addition approach with a linear regression step. We give an analogous result for the \eps-differential privacy setting. We also improve on the mean squared error upper bound for answering counting queries on a database of size

n

by Blum, Ligett, and Roth, and match the lower bound implied by the work of Dinur and Nissim up to logarithmic factors. The connection between hereditary discrepancy and the privacy mechanism enables us to derive the first polylogarithmic approximation to the hereditary discrepancy of a matrix

A

arXiv.org e-Print Archive

Crossref

Towards a Constructive Version of Banaszczyk's Vector Balancing Theorem

Author: Dadush Daniel
Garg Shashwat
Lovett Shachar
Nikolov Aleksandar
Publication venue
Publication date: 01/01/2016
Field of study

An important theorem of Banaszczyk (Random Structures & Algorithms `98) states that for any sequence of vectors of

\ell_2

norm at most

1/5

and any convex body

K

of Gaussian measure

1/2

\mathbb{R}^n

, there exists a signed combination of these vectors which lands inside

K

. A major open problem is to devise a constructive version of Banaszczyk's vector balancing theorem, i.e. to find an efficient algorithm which constructs the signed combination. We make progress towards this goal along several fronts. As our first contribution, we show an equivalence between Banaszczyk's theorem and the existence of

O(1)

-subgaussian distributions over signed combinations. For the case of symmetric convex bodies, our equivalence implies the existence of a universal signing algorithm (i.e. independent of the body), which simply samples from the subgaussian sign distribution and checks to see if the associated combination lands inside the body. For asymmetric convex bodies, we provide a novel recentering procedure, which allows us to reduce to the case where the body is symmetric. As our second main contribution, we show that the above framework can be efficiently implemented when the vectors have length

O(1/\sqrt{\log n})

, recovering Banaszczyk's results under this stronger assumption. More precisely, we use random walk techniques to produce the required

O(1)

-subgaussian signing distributions when the vectors have length

O(1/\sqrt{\log n})

, and use a stochastic gradient ascent method to implement the recentering procedure for asymmetric bodies

arXiv.org e-Print Archive

Repository TU/e

CWI's Institutional Repository